Coded Speech Quality Measurement by a Non-Intrusive PESQ-DNN
نویسندگان
چکیده
Wideband codecs such as AMR-WB or EVS are widely used in (mobile) speech communication. Evaluation of coded quality is often performed subjectively by an absolute category rating (ACR) listening test. However, the ACR test impractical for online monitoring communication networks. Perceptual evaluation (PESQ) one metrics instrumentally predicting results PESQ algorithm requires original reference signal, which usually unavailable network monitoring, thus limiting its applicability. NISQA a new non-intrusive neural-network-based measure, focusing on super-wideband signals. In this work, however, we aim at well-known metric using xmlns:xlink="http://www.w3.org/1999/xlink">PESQ-DNN model. We illustrate potential model scores wideband-coded obtained from operating different bitrates noisy, tandeming, and error-prone transmission conditions. compare our methods with state-of-the-art topologies xmlns:xlink="http://www.w3.org/1999/xlink">QualityNet , xmlns:xlink="http://www.w3.org/1999/xlink">WaweNet xmlns:xlink="http://www.w3.org/1999/xlink">DNSMOS —all applied to prediction—by measuring mean error (MAE) linear correlation coefficient (LCC). The proposed offers best total MAE LCC 0.11 0.92, respectively, conditions without frame loss, still when including loss. Note that could be similarly non-intrusively predict POLQA other (intrusive) metrics. definition code provided https://github.com/ifnspaml/PESQDNN .
منابع مشابه
A Bayesian approach to non-intrusive quality assessment of speech
A Bayesian approach to non-intrusive quality assessment of narrow-band speech is presented. The speech features used to assess quality are the sample mean and variance of bandpowers evaluated from the temporal envelope in the channels of an auditory filter-bank. Bayesian multivariate adaptive regression splines (BMARS) is used to map features into quality ratings. The proposed combination of fe...
متن کاملNon-intrusive speech quality assessment with low computational complexity
We describe an algorithm for monitoring subjective speech quality without access to the original signal that has very low computational and memory requirements. The features used in the proposed algorithm can be computed from commonly used speechcoding parameters. Reconstruction and perceptual transformation of the signal are not performed. The algorithm generates quality assessment ratings wit...
متن کاملPerceptually-based objective measure for non-intrusive speech quality assessment
This paper proposes a new perceptuallybased method for assessing speech quality and evaluates its performance. The method is based on comparing the received speech to an appropriate reference representing the closest match from a preformulated codebook. The codebook holds a number of optimally clustered speech parameter vectors extracted from a large number of various undistorted clean speech r...
متن کاملNon-Intrusive SOM-Based Speech Quality Assessment for Telephony Applications
A non-intrusive method for speech quality assessment in telephony applications is proposed and its performance evaluated. The method involves measuring perception-based objective auditory distances between the voiced parts of the processed (degraded) speech signal to appropriately matching references extracted from a pre-formulated codebook. The codebook is formed by optimally clustering large ...
متن کاملNon-intrusive Speech Quality Assessment in Simplified E-Model
The E-model brings a modern approach to the computation of estimated quality, allowing for easy implementation. One of its advantages is that it can be applied in real time. The method is based on a mathematical computation model evaluating transmission path impairments influencing speech signal, especially delays and packet losses. These parameters, common in an IP network, can affect speech q...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: IEEE/ACM transactions on audio, speech, and language processing
سال: 2023
ISSN: ['2329-9304', '2329-9290']
DOI: https://doi.org/10.1109/taslp.2023.3317574